BIGDATA: Collaborative Research:F: Manifold Hypotheses and Foundations of Data Science with Applications

ثبت نشده
چکیده

It is often said that progress in science is characterized by successive steps of measurement, arithmetization, algorithmization, and algebraization – each step representing in a succinct manner the intuitions collected in the earlier step. In sciences, various breakthrough in technology, e.g., sequencing, high-throughput measurement of DNA/RNA abundance, electron and scanning tunneling microscopy, astronomical studies, space telescopes, collection of social-media data, on-line observations of human interactions, etc., have made it possible to obtain a quantitative arithmetic picture of the “states” of a complex structure (cell, organism, population, social groups, universe) at a certain instant and under certain conditions. As the complexity of the systems studied have scaled, the “bigness” of the data has grown spectacularly; many scalable exact and approximate algorithms have been proposed; a unifying foundational study of the emerging “data science” has become prominent, and yet, it shies away from the final step of the algebraization of data sciences. Such an approach could center around the so-called “Manifold Hypothesis,” that seeks a differential algebraic structure in the state-space – to be inferred from the sampled data point clouds. We wish to build on “Manifold Hypothesis” to create an algebraic (geometric/topological) investigation of existing and emerging Big Data approaches in computer science, statistics, computational science, and mathematics, along with innovative applications in domain sciences, namely, cancer biology, linguistics and the physical sciences that lead towards the further development of the field of data science. Thus, the main emphasis of our study is “Foundational” (F): focusing on fundamental theories, techniques, methodologies, technologies of broad applicability to Big Data problems.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

S-Isomap++: Multi manifold learning from streaming data

Manifold learning based methods have been widely used for non-linear dimensionality reduction (NLDR). However, in many practical settings, the need to process streaming data is a challenge for such methods, owing to the high computational complexity involved. Moreover, most methods operate under the assumption that the input data is sampled from a single manifold, embedded in a high dimensional...

متن کامل

Intake Manifold Flow Assessment on a 3-cylinder Natural Aspirated Downsized Engine Using CFD and GT-SUITE

In this paper the intake manifold as the most effective part on engine’s volumetrice efficiency is in vestigated in detail with emphesizing on flow behavior and characterestics. The eight different design have been  prepared  and  imported to CFD software. Five objective functions for investigation of flow have been connsidered and applied in TOPSIS with different weights. Mass flow rate,massfl...

متن کامل

On a class of paracontact Riemannian manifold

We classify the paracontact Riemannian manifolds that their Riemannian curvature satisfies in the certain condition and we show that this classification is hold for the special cases semi-symmetric and locally symmetric spaces. Finally we study paracontact Riemannian manifolds satisfying R(X, ξ).S = 0, where S is the Ricci tensor.

متن کامل

A NOVEL FUZZY-BASED SIMILARITY MEASURE FOR COLLABORATIVE FILTERING TO ALLEVIATE THE SPARSITY PROBLEM

Memory-based collaborative filtering is the most popular approach to build recommender systems. Despite its success in many applications, it still suffers from several major limitations, including data sparsity. Sparse data affect the quality of the user similarity measurement and consequently the quality of the recommender system. In this paper, we propose a novel user similarity measure based...

متن کامل

International standard "OGC® moving features" to address "4Vs" on locational bigdata

Applications utilizing many types of location data, such as traffic congestion estimation and facility management using indoor pedestrian tracks, have been rapidly increasing. Such applications require the integration of various locational data from different data sources to produce more values. Efforts to ensure smoother data exchange are required for promoting the use of such applications bec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015